AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Lightweight Visual Question Answering

# Lightweight Visual Question Answering

Moondream 2b 2025 04 14 4bit
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.
Image-to-Text Safetensors
M
moondream
6,037
38
Dermatech Qwen2 VL 2B GGUF
This is a multimodal model based on the Qwen2 architecture, supporting text generation, image-to-text, and visual question answering tasks, with multiple quantized versions to meet diverse needs.
Image-to-Text English
D
mradermacher
42
0
Qwen2 VL 2B Instruct GGUF
Apache-2.0
Qwen2-VL-2B-Instruct is a multimodal vision-language model that supports image-text generation tasks, based on the Qwen2 architecture with a parameter scale of 2B.
Image-to-Text English
Q
second-state
125
3
Tinyllava 1.1b V0.1
Apache-2.0
A lightweight visual question answering model based on TinyLlama-1.1B, trained using the BakLlava codebase, supporting image content understanding and question-answering tasks.
Text-to-Image Transformers
T
TitanML
27
0
Tinyllava 1.1b V0.1
Apache-2.0
A lightweight visual question answering model based on TinyLlama-1.1B, trained using the BakLlava codebase
Text-to-Image Transformers
T
0xAmey
16
21
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase